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Abstract 

We are given an acyclic directed graph with one source, and a subset of its edges which 
contains exactly one outgoing edge for every non-sink vertex. These edges determine a 
unique path from the source to a sink. We can think of it as a switch in every vertex, 
which determines which way the water arriving to that vertex flows further. 

We are interested in determining either the sink the flow arrives, or the whole path, 
with as few questions as possible. The questions we can ask correspond to the vertices of 
the graph, and the answer describes the switch, i.e. tells which outgoing edge is in our 
given subset. 

Originally the problem was proposed by Soren Riis (who posed the question for pyra- 
mid graphs) in the following more general form. We are given a natural number k, and 
k questions can be asked in a round. The goal is to minimize the number of rounds. 
We completely solve this problem for complete i-ary trees. Also, for pyramid graphs we 
present some non-trivial partial results. 

1 Introduction 

In this paper we consider the following problem. We are given a directed acyclic (multi)graph 
G with one source. We are given a set E* (Z E such that for every non-sink vertex v & V there 
is exactly one outgoing edge in E*. It determines a path from the source to a sink. We can 
think of it as a switch in every vertex, which determines which way the water (coming from 
the source) arriving to that vertex flows further. We imagine such graphs such that the source 
is on the top and the sinks are on the bottom. In the rest of the paper by a graph we always 
mean such a directed graph except when noted. 

We are interested in determining either the whole path P = P{E*) or only the sink the flow 
arrives, i.e. the last vertex of P{E*). The questions we can ask correspond to the vertices of 
the graph, and the answer describes the switch, i.e. tells which outgoing edge is in E*. 
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Definition 1.1. Let pa{G) = pai{G) (resp. si{G) = sii{G)) be the minimal number of ques- 
tions we need to determine the path (resp. the sink). 

Clearly si{G) < pa{G), but these can be very far from each other, for example if there is 
only one sink, then si{G) = 1 but pa{G) can be arbitrarily high. 

Soren Riis proposed the following problem. What is the minimal number of rounds we need 
if in one round we can ask k questions and our aim is to determine the path or the sink in 
the pyramid graph (for the definition see Section H])? This motivates the study of the following 
more general question. 

Definition 1.2. Fixing a k, pak{G) is the minimal number of rounds we need to determine 
the path, if in each round we can ask k questions. Similarly, sik{G) is the minimal number of 
rounds we need to determine the sink, if in each round we can ask k questions. 

As an additional motivation, let us consider a game and two possible strategies given as 
black boxes, i.e. we can evaluate in every possible state what the strategies do. To represent 
the game with a directed acyclic graph, it is enough represent every valid (turn number, state) 
pair with a different vertex, and the valid steps are represented by the directed edges. For every 
finite game, this is a finite acyclic directed graph. Now we can simulate a match between two 
players using the respective strategies. To find out which strategy is the winner of a match, 
one can go step by step and ask what happens in every actual situation. Note that it does not 
matter if the black box strategies use randomness or not. This process is a very important step 
of Monte-Carlo type algorithms, when one needs to quickly simulate matches between strategies 
[I1I21I3]. 

However, suppose we are given multiple processors. A solution to the original sink-search 
problem for the above defined graph gives an optimal parallel algorithm to determine the final 
state of the match. Clearly, such an algorithm determines the winner of the match as well. 
Summarizing, investigating our problem can help find faster algorithms for simulating matches, 
if parallel computations are allowed. (We note that we do not think it could help a lot. We 
were able to find an example where the optimal algorithm is much faster than the trivial one, 
but it was a very special graph. Also in case of games usually the graph is very large, which 
makes it hard to find a good non-trivial algorithm for our problem.) For further motivations 
related to random walks see the beginning of Section HI 

In the next section we examine si{G) and pa{G) in directed acyclic graphs. In Section [3] 
we consider sik{G) and pak{G) on trees and completely solve the problem on ci-ary trees. In 
Section H] we consider the problem on the pyramid graph. In the last section we conclude our 
paper with some additional remarks and open problems. 
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2 Search in directed acyclic graphs 



We will show a process how to transform the graph G into a graph G' such that si{G) = si{G') 
and si{G') can be easily determined. We also show how to transform G into a graph G" such 
that pa{G) — pa{G") and pa{G") can be easily determined. 

But at first let us consider a trivial algorithm which finds both the sink and the path: at 
first the source is asked. Then the answer tells us which vertex is the next on the path. That 
one is asked in the next turn, and so on. This gives us the following simple observation. 

Observation 2.1. For every (multi)graph G, si{G) < pa{G) < 1{G), where 1{G) is the length 
(the number of edges) of the longest path. 

We start with examining si{G). We can simply forget about multiple edges then, hence we 
suppose G is simple. Obviously it's useless to ask a vertex with outdegree 1. We will define a 
graph G" with no such vertices such that if we find the sink in G' it gives us the sink in G (in 
fact it is the same vertex). We introduce the following merging operation: for a set of vertices 
M, we get G{M) from G by deleting the vertices M and introducing a new vertex m, if there 
was an edge between a vertex v in G \ M and a vertex w in M then we put an edge in the 
new graph between v and m with the same orientation as in G. If multiple edges come into 
existence, we consider them as one edge. Edges between vertices of G \ M stay untouched. 
Acyclicity could be ruined by such an operation but anytime we do such an operation, it will 
be easy to see that acyclicity remains true. 

If a vertex x of G has exactly one out-neighbor y, then G{{x, y}) remains acyclic and 
si{G{{x,y})) = si{G). By merging this way vertices with outdegree 1 with their outneighbors 
as long as it is possible, we get a graph G' with vertices all having outdegree minimum 2 (except 
the sinks) and for which si{G') = si{G). Although it is not relevant for us, it is easy to see 
that G' does not depend on the order in which we process the vertices. The merging procedure 
defines a map from the vertices of G to the vertices of G", hopefully without causing confusion, 
we will refer to the image of some vertex a; of G also as x. By the procedure, every vertex of 
G' has at least 2 out-neighbors or it is a sink. 

Observation 2.2. // a vertex y can be reached from a vertex x in G, then y can be reached 
from X in G' as well. 

The main result of this section is that after merging every vertex with outdegree 1 to get 
G", si{G') can be determined easily: 

Theorem 1. Suppose there is no vertex with outdegree 1 in a simple graph G. Then si{G) — 
1{G), where 1{G) is the length (the number of edges) of the longest path. 
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Proof. We need some preliminary observations. Let us examine what happens when a question 
is answered. Clearly xy G E* is equivalent to the following: all the other edges starting from 
X are not in E*. But then we can simply delete these edges to get a new graph and then the 
outdegree of x becomes 1, hence we can merge x with y to get another graph which we denote 
by Gxy, ie. this answer reduced the problem to finding the sink in Gxy, which is the graph 
we get by deleting from G all the edges going out from x except xy and then merging x and 
y. Thus, if the first question is x and the answer is xy, then si{Gxy) additional questions are 
needed to find the sink. 

Definition 2.3. A directed path is called full if it ends in a sink. 

Let us consider G'^y. By definition x and y are merged. Then if a vertex z had only two 
outneighbors, x and y, it gets merged with them. Then if all the outneighbors of another vertex 
are among x, y and z it also gets merged with them, and so on. Let M be the set of vertices 
M of G for wich every full path starting at some vertex u in M contains x or y. By the above 
argument, for this M, G'^y = Gxy{M). 

Observation 2.4. // in G there is a full path starting at some vertex z and avoiding x and y, 
then when merging x and y, z cannot he among the merged vertices M . 

Note that if G does not have vertices with outdegree 1, then if a vertex m is among the 
merged vertices M, then there are paths to both x and y from u, hence by acyclicity of G there 
are no paths from x to u. 

Observation 2.5. // there is a path from x to z in G, then when merging x and y, z cannot 
be among the merged vertices M. 

Now we can start to prove the theorem. 

By Observation 12.11 si(G) < pa{G) < 1{G), hence it is enough to prove 1{G) < si{G). We 
prove it by induction on k = 1{G). The case k = 1 can be easily seen. 

Now we describe the strategy of the adversary. Let P be a path of length /c, consisting of 
the edges X1X2, X2X3, . . . , XkXk+i- 

Case 1. Suppose that the first vertex asked is Xj, and there is an edge Xj-iXj+i in the graph. 
Then the adversary's answer should be an edge Xiy, where y 7^ Xj+i. It means we continue with 
the graph G^^y It might contain vertices with outdegree 1, hence we need to determine G'^_y. 

By Observation 12.41 the vertices Xj (j 7^ i) do not get merged in G'^.y, thus there is a path 
of length A; — 1 in G'^_y, containing xi, . . . , Xj+i, . . . , Xk+i in this order. Then by induction 
at least k — 1 additional questions are needed in G'^_y. 

Case 2. The first vertex asked is Xi but there is no edge Xi-iXi+i in the graph G. Then 
the adversary's answer should be the edge XiXi+i. No vertex Xj with j > i gets merged by 
Observation 12. 5[ Also, from a;j_i there is an edge going out which is not the edge Going 
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along this edge we can find a path Q. First we claim that for any choice of such a Q, it does not 
contain Xj. Indeed, otherwise the original path P minus the edge Xi^iXi plus this path Q would 
give a longer path in G, a contradiction. Further, Q can be chosen such that it avoids Xj+i as 
well. Indeed, every outdeegree is at least 2, so we always have at least one choice different from 
Xi+i to continue. This way, for every vertex Xj with j < i there is a full path starting at xj 
and avoiding both Xj and Xj+i, thus by Observation 12.41 none of these vertices gets merged in 
^'x,x^+i- Hence the path xi, . . . , Xj+i, . . . , Xk+i is a fc — 1 long path in G^.^..^^, the induction 
can be applied. 

Case 3. The first vertex asked, x is not in the path P. If it has an outneighbor y not on 
the path, that should be the adversary's answer. Then clearly the full path P is in G'^^ and 
by Observation 12.41 none of its vertices are merged. Hence by induction k more questions are 
needed. 

Thus we can suppose that all the outneighbors are on P. Let Xi be the first and xj be the 
last among them. Then the answer of the adversary is xj. Again, by Observation 12. 5[ no xi 
with / > j gets merged. Similarly as in Case 2, any full path starting at Xi must avoid x and 
also we can choose such a path which avoids Xj as well. This path Q shows by Observation 12.41 
that Xi won't be merged. By adding to Q the appropriate part of the path P, for any xi, I < i 
we can build a path avoiding both x and Xj, thus showing that these vertices won't be merged 
neither. Finally, for a vertex xi with i < I < j, xi can be reached by a directed path from x 
(starting with the edge xxi and then going along P), thus by Observation 12.51 such a vertex 
cannot be merged neither. 

As none of the vertices of the path are merged in G'^^ , by induction k more questions are 
needed. □ 

Corollary 2.6. For any (multi) graph G, si{G) = si{G') = 1{G'). 

Theorem [1] implies also that for a simple graph G, pa{G) = 1{G), if all non-sink vertices of 
G have outdegree at least 2. We will prove that this holds even for multi-graphs. 

We examine the path-search problem for multigraphs, hence we can claim again: it's useless 
to ask a vertex with outdegree 1. We introduce the following modified merging operation for 
multigraphs: for a set of vertices M, we get G[M] from G by deleting the vertices M and 
introducing a new vertex m, if there was an edge between a vertex f in G \ M and a vertex w 
in M then we put an edge in the new graph between v and m with the same orientation as in 
G. Now, differently from the previous merging operation, if parallel edges come into existence, 
we consider them as parallel edges. Edges between vertices of G \ M stay untouched. 

Merging a vertex x with outdegree 1 with its outneighbor y to get G[{a;, ?/}], the graph 
remains acyclic and pa{G[{x,y}]) = pa{G). By merging this way vertices with outdegree one 
with their outneighbors as long as it is possible, we get a graph G" with vertices all having 
outdegree minimum 2 (except the sinks) and for which pa{G") = pa{G). 
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Theorem 2. Suppose there is no vertex with outdegree 1 in a (multi)graph G. Then pa{G) = 
liG). 

Proof. We get pa{G) < 1{G) from Observation 12.11 so we only have to deal with the other 
direction. We examine what happens when a question is answered. Having xy G E* is still 
equivalent to the following: all the other edges starting from x are not in E*. But then we can 
simply delete these edges to get a new graph and then the outdegree of x becomes 1, hence we 
can use our new merging operation G[{a;,|/}] to get another graph which we denote by G^^, 
ie. this answer reduced the problem to finding the sink in G^^, which is the graph we get by 
deleting from G all the edges going out from x except xy and then merging x and y. Thus, if 
the first question is x and the answer is xy, then pa{G^y) additional questions are needed to 
find the sink. Note that [G^^)" = G^^ . Now, if the asked vertex is from the fixed maximal path 
P, then the adversary answers the edge of the path, otherwise it answers arbitrarily. Thus, 
after any question and answer only at most one edge of P gets merged, thus the length of the 
maximal path reduces by at most one. From this by induction it easily follows that the size of 
the maximal path is a lower bound to the number of questions needed. □ 

Corollary 2.7. For any (multi) graph G, pa(G) = pa{G") = 1{G"). 

Another interesting question is the case when the graph contains a directed cycle. In this 
case it can happen that following the fiow we get stucked in a directed cycle and never go to a 
sink. 

We begin with the path-search problem. Let us call a set of edges a generalized path if it 
contains a directed path and possibly an additional edge which goes from the last vertex to 
a vertex already on the path. Let l'{G) denote the length (number of edges) of the longest 
generalized path starting at the start vertex. 

One can easily see that if a multigraph G does not contain any vertices with outdegree 1, 
then pa{G) = l'{G). Indeed, we can copy the proof of Theorem [2J Consider a graph G and a 
generalized path P of length l'{G). No matter what the question and the answer are, at most 
two vertices are merged, hence at least l'{G) — 1 vertices of P remain. Again, the adversary 
answers in a way that for the vertices of the fixed maximal generalized path P the answer is 
always an edge of P. Then the remaining vertices of P form a generalized path of length at 
least l'{G) — 1, and the induction can be applied. Similarly, Corollary 12 . 71 remains true as well. 

On the other hand, in case of si{G) we encounter problems as our basic operation for 
handling an answer cannot be applied, as we might lose some information at every merging. 
For example let us suppose z has only two outneighbors, x and y. Then z gets merged with 
them in Gxy However, if z, x and y are part of the cycle where the fiow ends in this order, then 
after merging x and y there is no way to differentiate this cycle from the other cycle where y 
comes immediately after z. 



6 



3 Search in trees 



We consider trees as rooted directed trees where the edges are directed away from the root. 
For complete d-arj trees on n + 1 levels (denoted by Td{n)) the obvious algorithm that asks as 
many complete levels as possible from the beginning, is the best possible: 

Theorem 3. pakiTdin)) = sik{Td{n)) = \n/ log'^k] , where log^ is defined as the biggest integer 
i for which 1 + d + d'^ + . . . + d^~^ < k holds. 

Proof. Clearly, if we ask in each round the first as many full levels as possible, in each round 
Pmax gets longer by log^ in worst case, thus in \n/ log^ k] rounds we can easily find the path. 
We give a strategy to the adversary, so that his answers force that from k queries of any round, 
only at most log^ will be on the final path P. This way, as the final path has length n, there 
were indeed at least [n/log^A;] queries altogether. 

Now fix a round i, define S as the set of vertices asked in this round. Now take a vertex 
V E S asked in this round. This vertex has d children, taking these as roots, they define d 
maximal subtrees of T, the jth such tree containing rj further vertices of S. For v, we answer 
the edge that goes to a child that has the minimal rj value. This defines an answer to every 
vertex in round i. 

We claim that this way at most log^ vertices from round i will be on the final path P. 
Suppose that there are x vertices from round i that are on P. In reverse order we denote them 
by qo,qi, . . . qx-i, where qx-i is the one closest to the root. Define Qj as the maximal subtree of 
T having qj as its root. We prove by induction on j that IQjflS'l > l + d+d'^ + . . . + d^ . For j = 
this is obvious. For a general j we define the trees Ri, . . . Rd, where Ri is the maximal tree having 
q/s rth child as its root. Wlog. assume that the edge of P at qj goes to the root of Ri. Now by 
the way the adversary answers, we know that ri = Rid S is minimal for / = 1 and as -Ri contains 
the vertex and everything below it, by induction |_RinS'| > IQj-iHS'l > l+d+d'^ + . . .+d^~^. 
Summing this up, IQiOS] = l + \Rir]S\ + \R2r]S\ + . . .\Rdr]S\ > l + d{l + d + d'^ + . . . + d^-^) = 
l + d + + . . . + c?^' as claimed. Finishing the proof, A; = \S\ > \Qx-i(^S\ > 1 + d + d^ + . . . + d''-^ 
implies that x < log^ k. □ 

As 1 + 2 + 2^ + . . . + 2*^^ = 2* — 1, for d = 2 the formula in Theorem [3] can be simplified: 

Corollary 3.1. For a complete binary tree on n levels, pak(T2{n)) = sik{T2{n)) = [n/Llog2(A; + 
1)J1 

We remark that for general trees, the obvious algoritm is not always the best possible. For 
example take the tree that starts from the root with a path piP2 ■ ■ - Pn of length n, each pi 
having one further child except Pn, where there is a complete n level binary tree with root p„. 
Now for k = 2 the obvious algorithm asking always close to the root, increases Pmax by 2 in 
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each round until reaching p„, then while processing the binary tree, in each round it can only 
increase Pmax by 1? thus it finishes approximately in 3n/2 rounds. On the other hand, if in 
each round we ask one vertex from the path and one vertex from the remainder of the binary 
tree, in each round the path shortens by one and the binary tree has one less levels. Thus in n 
rounds we can finish with both parts. 

4 Search in pyramid paths 

The pyramid graph Py{n) is a directed graph defined in the following way. Py{n) has = 
n(n + 1)/2 vertices on n + 1 levels, for 1 < i < n + 1 the zth level having i vertices Wj^i, Wj^2 • • • Vi^i, 
and from every vertex Vij where 1 <i <n and 1 < j < i, there is a left outgoing edge going to 
Vi+ij (its left child) and a right outgoing edge going to fj+i^+i (its right child). Py{n) has one 
root on the top, f i^i and n + 1 sinks on the bottom, the vertices on the (n + l)th level. 

Let us suppose we are given a one dimensional random walk and we want to find either the 
endpoint, or the whole walk. It is not obvious what search model makes sense here. If we can 
ask which way it goes in each step, then clearly we need to ask every step, and that is enough, 
the order of the questions do not matter. But suppose that a question is the following: which 
direction does the walk go at the ith step if it is in the jth position? 

Clearly it is equivalent to our model on the pyramid graph. 

Observation 4.1. For arbitrary k and n we have 
(a) pai{Py{n)) <pak{Py{n)), 
(h) pak+i{Py{n)) <pak{Py{n)), 
(c) sik+i{Py{n)) < sik{Py{n)). 

It is trivial that iik < N then in both cases we need at least 2 rounds. I.e. the non-adaptive 
version (having 1 round) of both problems needs N queries. Indeed, for k < N there is a vertex 
that we did not ask and so the adversary can answer in a way that the path leads to this vertex, 
everything below this vertex is in a left state and so the state of this non-asked vertex would 
determine the path and the sink as well. 

The fully adaptive version of the problem is again pretty simple. 

Claim 4.2. sii{Py{n)) = pai{Py{n)) = n. 

It follows from Theorem [1] but we also give a simple proof specific to pyramid paths. 
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Proof. In n rounds it is easy to determine the path and its sink. First we ask the root and 
then according to the answer, its left or right outgoing neighbor. We continue this way, in the 
ith round determining the ith edge of the path, finally asking a vertex on the nth level, thus 
determining the whole path. 

Suppose now that we asked less than n questions, then there is a level i < n, from which 
we did not ask any vertex. The adversary answers always left and also at the end he tells us 
that any vertex not on level i is in left state. Thus we know which vertex of level i is on the 
path, and the state of this vertex (which we don't know) would determine the path and also 
its sink. □ 

Conjecture 4.3. sisiiPyin)) = pasi{Py{n)) = In/l] if si = 1 + 2 + . .. + 1 for some I. 

The upper bound holds by a simple algorithm: 

Claim 4.4. sis,{Py{n)) < pasi{Py{n))] < \n/l] if si = 1 + 2 + . . . + I for some I. 

Proof. The algorithm is recursive. In the first round we ask the si vertices that are on the first 
/ level, thus we will know the first I edges of the path and also we know the vertex u on the 
(/ + l)st level that is on the path. Now take the new pyramid graph with n levels with root u, 
by recursion we can find the path here in \{n — 1)//] = In/l] — 1 rounds. This path together 
with the first / edges gives the path we were looking for in the original pyramidal graph and we 
had 1 + [n//] — 1 = [n//] rounds as needed. To start the recursion we need that if n < / then 
one round is enough. This is trivially true as in one round we can ask all the vertices that are 
not sinks, and so we can determine the path. □ 

The main result of this section is the following. We give a general lower bound that verifies 
Conjecture 14.31 for / = 2 (i.e. k = si = 3) and solves the case k = 2. 

Theorem 4. For arbitrary k, pak{Py{n)) > sik{Py{n)) > [^l^'^l ■ 

Proof. We give two different proofs. While processing an algorithm which finds the path/sink, 
there is always a maximal partial pyramid path Pmax that we know from the answers until now, 
i.e. the path determined by the state of the already known vertices (note that Pmax is changing 
by time). The basic idea in both proofs is that in each round there is only one question which 
immediately makes Pmax longer by one and for the rest of the questions, only pairs of them can 
determine one more edge in Pmax- In both proofs the adversary has the following answering 
scheme. In each round he answers for the k asked vertices in reverse order of their height i.e. 
he first answers for the one which is on a level with a biggest index (if there are more asked 
vertices on the same level, then their order does no matter), etc.. This way at most one vertex 
per round is the endvertex of Pmax when it is asked. 
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First proof of the lower bound. If a vertex is not an end vertex of Pmax then we just answer 
left. If the asked vertex v is an endvertex of Pmax then we do the following. Let ?;'s left child 
be u and its right child be w. Compute the length of the path starting from u determined by 
the already known states of vertices (when we reach a vertex with unknown state, that's the 
end of the path, eg. it may be already u if we don't know it's state). Similarly, the length of 
the path starting at w is Iw Now we answer left for the state of v if < lu and right otherwise 
i.e. we choose to go in the direction where the continuation of the path will be shorter. 

When analyzing this method we just concentrate on the vertices which at the stage when 
they are asked, are the endpoints of the current Pmax- As already mentioned, we consider only 
one such point in each round. After our answer to a vertex v the path gets longer by at least 
one. If by more than one, then wlog. we have chosen u, its left child and so < i^^, where w is 
its right child. As for non-endpoints of Pmax we always answer left and the two paths starting 
at u and w contain only such vertices, they are completely disjoint. I.e. for all but one edge 
of the new Pmax determined in this round, we found two vertices for which the state was asked 
already. As Pmax is increasing, in each round we find new such pairs of vertices. It is easy to 
see that doing this the sink is determined if and only if the pyramid path is determined as well. 
Suppose now that after m rounds the whole pyramid path is determined, i.e. all n edges of it. 
In each round there was at most one endvertex of Pmax asked, which means that at most m 
edges were determined by them and for the rest of the edges we found two asked vertices for 
each. Thus together there where at least m + 2{n — m) — 2n — m questions. Thus we had at 
least m > {2n — m)/k — {2n — m)/k rounds which implies m > -j^n as needed. 

Second proof of the lower bound. If a vertex v is not an endvertex when it is asked then we 
check if there is another vertex v' on the same level with known state. If there is, then we give 
the same answer for v as we gave for v' and additionally we tell that the state of every vertex 
on this level is the same. Otherwise we give an arbitrary answer and also say that either there 
will be one more asked vertex in this level or we will avoid v. If is an endvertex of Pmax, then 
we determine the first level under it for which we did not tell the state of every vertex on that 
level. On this level either there is no asked vertex, then our answer to v is arbitrary or there 
is a vertex q which was already asked. Now either answering left or right to v will make sure 
that the endvertex of the new Pmax is on the same level as q but a different vertex. Thus the 
question when we asked q became useless, wlog. we can assume that there is no such q. This 
way questions which were endvertices of Pmax determine one edge in Pmax and pairs of the rest 
of the vertices determine a whole level, i.e. one edge in Pmax- It is easy again to sec that doing 
this the sink is determined if and only if the pyramid path is determined as well. The same 
computation as in the first proof yields the desired lower bounds. □ 

Corollary 4.5. pa2{Py{n)) = si2{Py{n)) — 
pa^{Py{n)) = siz{Py{n)) = [^n]. 
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Proof. Theorem m implies si2{Py{n)) > \^n] and siz{Py{n)) > \^n]. We now need to give 
algorithms for finding the paths, that achieve these bounds. For k = 3 the algorithm in Claim 
14.41 can be applied. For k = 2 first we ask the root Vi^i and v^^2- Wlog. the root is in left state. 
Now in the second round we ask V2^i and ^3^1. After these two rounds we will know the first 3 
edges of the path and then we can proceed by recursion (taking the endvertex of this 3 long 
path as the new root). □ 

Pyramid graphs can be easily generalized to d- dimensions, see eg. the paper of Sun et al. 
[3] In this paper a pyramid graph is represented on the non-negative part of the 2-dimensional 
grid with the origo being its root. In a similar way a d-dimensional pyramid graph is a part of 
the (i- dimensional grid. In the following we give a generalization of pyramid paths that includes 
the (i-dimensional pyramid path. A generalized pyramid graph GPyd{n) is a directed graph 
having the following properties. GPyd{n) has its vertices on n + 1 levels such that the first 
level has one source vertex and the last level contains only sinks. From any vertex Vi on level 
i < n, there are d outgoing edges to level i + 1, and between each two levels i and i + 1 there 
is a matching Lj that matches level i to level i + 1. As a consequence, on each non- first level 
there are at least d vertices. 

Theorem 5. For any generalized pyramid graph GPyd{n) and arbitrary k, pak{Py{n)) > 
st,iPyin))>\^^n]. 

Proof. Both proofs of Theorem H] easily generalize to this setting. Here we present a proof using 
the second method. We refer to the edges of all the Lj's as left edges. If a vertex v is not an 
endvertex when it is asked then we check if there are at least d — 1 another vertices on the 
same level with known state. If no, then we give answer left (i.e. the edge from the appropriate 
matching) and also say that either there will be one more asked vertex in this level or the final 
path won't go through v. If there are at least d — 1 vertices already on this level with known 
state, then we again give answer left for v and additionally we tell that the state of every vertex 
on this level is left. If v is an endvertex of Pmax which is on level i, then we determine the first 
level j under it for which we did not tell the state of every vertex on that level. If on this level 
there are at most d — 1 asked vertices, then our answer to v is such that the new endvertex of 
Pmax is a vertex on level j that was not yet asked. This can be done, as the at least d different 
choices for the state of v all yield to different endvertices on all the levels from i to j, as on 
every level every known state is a matching edge and the matching edges never go to the same 
vertex. 

It is again easy to conclude that in each round there is only one question which immediately 
makes Pmax longer by one and for the rest of the questions, only (i-tuples of them can determine 
one more edge in Pmax- Simple computation gives the lower bound sik{GPyd{n)) > jr^^n. □ 
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We also remark that Theorem [J] cannot be improved as there are generahzed pyramid graphs 
for every d where equahty holds. Indeed, take the n + 1 level generalized pyramid graph having 
d vertices on each non-first level for which every non-sink vertex is connected to every vertex 
on the next level. This graph is uniquely determined by n and d. Now in this graph an obvious 
algorithm is to ask in each round the endvertex of Pmax and as many complete levels under it 
as possible. This way in each round Pmax gets longer by 1 + [{k — l)/d\ = [{k — 1 + d)/d\, 
thus we can determine the path in at most [n/([ ^~^"'"'^ J)] rounds, If A; — 1 is divisible by d then 
this upper bound matches the lower bound of Theorem [51 

5 Remarks 

Let us consider again the problem of pak{G) and sik{G) for directed acyclic graphs. Obviously 
the same preprocessing as in the case k = 1, replacing G by G' or G", is useful in general. From 
now on we suppose that G does not have vertices with outdegree 1 (which is always true for G' 
and G"). 

One could ask how far si{G) and sikiG) (or pa{G) and pak{G)) can be. Obviously si{G) < 
ksik{G) as the same at most ksik{G) questions which were used to find the sink in the case of 
sik{G) could be used one-by-one to find the sink in the case of si{G). An example where this 
bound is achieved is the graph Hi consisting of a directed path of length kl with each vertex 
on the path having another out-neighbor, which is a sink. More precisely let a;i, . . . , Xki+i be 
vertices of Hi such that XjXj+i G E for every i < kl. Additionally, every Xi with i < kl has 
a child, which is a sink. One can easily see that in the worst case (that can be forced by the 
adversary) the sink is found if and only if xi, . . . ,Xki have been asked, hence si{Hi) = kl and 
sik{Hi) = I. The same is true if we want to find the path. 

An example where si{G) and sik{G) (or pa{G) and pak{G)) are close to each other is 
the complete fc-ary tree from Section [31 It follows easily from Theorem [3 that si{Tk{n)) = 
sik{Tk{n)) = pa{Tk{n)) = pak{Tk{n)) = n. 

More generally, one could ask how far sim{G) and sik{G) (or pa^niG) and pak{G)) can be 
for any constants m < k. Obviously sim{G) > sik{G), and similarly to the arguments used in 
the case m = 1, simiG) < \k/rri\sik{G). 

One can easily construct a graph where this bound is achieved. We just mention the main 
ideas without details. Let us consider a fc-ary tree with / levels and replace each vertex with a 
copy of Hi. The trivial algorithm is to go through the copy of H corresponding to the source 
of the tree, then in the appropriate child of it, and so on. This gives sikiG) < pak{H') < I and 
sim{H') < \k/m\l. On the other hand the method of the adversary can be the following: if a 
vertex is asked and it is not in the upper- most copy of Hi, the path won't even go into that copy 
of Hi which contains this vertex. It shows that there is equality in the previous inequalities. 
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Summarizing: 

Claim 5.1. For arbitrary m < k sim{G) < \k/rri\sik{G) and there are infinitely many G graphs 
for which equality holds. 

For larger fc, one can easily improve the trivial (and optimal) algorithm we mentioned in 
Section [2] for k = 1. At first we ask the source, all its out-neighbors, every vertex which can be 
reached from the source in a path of length two and so on. If there is an i for which we cannot 
ask every vertex which can be reached from the source in a path of length i, then we ask as 
many as we can, chosen arbitrarily. Then the answers show the beginning of the path P{E*). 
We repeat this procedure starting with the last vertex which is surely in P{E*). This algorithm 
(let us call it Algorithm A) finds the sink and the path too, using at most 1{G) questions. 

Clearly a smarter algorithm cannot be more than k times faster than this trivial one for a 
graph (without vertices of outdegree 1), as it would mean sik{G) < l{G)/k = si{G)/k. 

Now for any k we show a graph G and algorithm which can achieve this bound (depending 
on how the arbitrary vertices are chosen in the trivial algorithm). Let xi, . . . , Xki+i be vertices 
of the graph such that XjXj+i G E for every i < kl. Additionally, every Xi with i < kl has 
k — 1 children, each of them having two children, and Xki has k — 1 additional children. These 
additional vertices are all distinct, hence the graph is a tree. Algorithm A asks xi and k — 1 
of its children in the first turn. It is possible that it does not ask X2- Suppose the path P{E*) 
goes to Xki+i- If the arbitrary vertices are always chosen the worst possible way, than kl turns 
are needed (even if they are chosen smarter, at least kl/2 turns are needed). 

However, consider the following Algorithm B. At the first turn we ask Xi, . . . ,Xk- If the 
path P{E*) does not go to Xk+i, then we need to ask one more questions to finish the algorithm, 

otherwise we continue with xi+i, ,X2k, and so on. One can easily see that Algorithm B 

finishes after at most / turns. 

Summarizing (we denote by si^(G) the number of steps in which Algorithm A finds the 
sink) : 

Claim 5.2. sik{G) > siA{G)/k and there are infinitely many G graphs for which equality holds. 

Also, as we noted before, our proof that pa{G) = 1{G), if there is no vertex with outdegree 
1, can be interpreted even for graphs containing cycles, yet we could not prove such a claim 
for si{G) if G contains a cycle. Even the question is not clear in this case. We could ask for 
the sink or cycle where the flow ends, analogously to the acyclic case. On the other hand if we 
want to know every edge of the cycle, it is more similar to pa{G). A possible goal could be to 
determine the sink or the last vertex before/after creating a cycle in the flow. 

Problem 5.3. Give an efficient algorithm to determine in this sense si{G) if G contains a 
cycle. 
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The most interesting open problem is still to determine pak{Py{n)) for every k and n or at 
least to prove Conjecture 14.31 Further, our results suggest that the following might be true. 

Problem 5.4. Is it true that pak{Py{n)) = sik{Py{n)) for every k and n? 

In a paper of Sun et al. [4] a very similar problem was investigated. In their version of 
the problem, in one question we can ask for a vertex if it is on the path or not. Let us call 
the minimal number of questions for this version pa'f^{G) and si'i^{G). For pyramid graphs, 
the completely adaptive (i.e. one questions per round) version, similarly to our problem, we 
need n rounds. However, they do not regard the version when we can ask more questions 
per round. It is trivial that a question of our kind can be emulated by 3 questions of their 
kind (asking the vertex and also its 2 outgoing neighbors), thus pak{Py{n)) < 3pa'i^{Py{n)) and 
pa{Py{n)) < 3si'i.{Py{n)). It would be interesting to know more about these two new functions. 

What Sun et al. investigate is that for pa' and si', eg. for pyramid graphs algorithms using 
randomization can find the path much faster than deterministic ones. This might show one 
major difference between these two sets of problems, as in the version we regard, randomization 
does not seem to help much. One possible intuition behind this difference is that a left/right 
answer just gives a relative information, which might be completely useless to determine our 
path, whereas in their case any answer gives some information about the path, i.e. whether it 
goes through that vertex or not. 

Also, in their paper this was a major tool to give bounds to various Local Search Problems. 
It would be interesting to see whether our version has similar theoretical applications. 
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